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Orotidine-5 '-phosphate decarboxylase gene, gene construct 
comprising this gene and its use 

5 The invention relates to an orotidine-5 '-phosphate decarboxylase 
gene having the sequence SEQ ID No. 1 or its homologs, to a gene 
construct comprising this gene or its homologs, and to its use. 
The invention additionally relates to vectors or organisms 
comprising an orotidine-5 ' -phosphate decarboxylase gene having 
10 the sequence SEQ ID No. 1 or its homologs. 

The invention further relates to a process for producing 
uracil-auxotrophic microorganisms and to a process for inserting 
DNA into uracil-auxotrophic microorganisms. 

15 

Vitamin B2, also called riboflavin, is essential for humans and 
animals. Vitamin B2 deficiency is associated with inflammations 
of the mucosa of the mouth and throat, pruritus and inflammations 
in skin folds and similar cutaneous lesions, conjunctival 

20 inf lammations, reduced visual acuity and clouding of the cornea. 
In babies and children, cessation of growth and weight loss may 
occur. Vitamin B2 therefore has economic importance in particular 
as vitamin supplement in cases of vitamin deficiency and as 
animal feed supplement. It is additionally used as food color, 

25 for example in mayonnaise, ice cream, blancmange etc.. 

Vitamin B2 is prepared either chemically or microbially (see, for 
example, Kurth et al., 1996, Riboflavin, in: Ullmann's 
Encyclopedia of industrial chemistry, VCH Weinheim) . In the 

3o chemical preparation processes, riboflavin is usually obtained as 
pure final product in multistage processes, it being necessary to 
employ relatively costly starting materials such as, for example, 
D-ribose. An alternative to the chemical synthesis of riboflavin 
is the preparation of this substance by microorganisms. The 

35 startin 9 materials used in this case are renewable raw materials 
such as sugars or vegetable oils. The preparation of riboflavin 
by fermentation of fungi such as Eremothecium ashbyii or Ashbya 
gossypii is known (The Merck Index, Windholz et al., eds. Merck & 
Co., page 1183, 1983), but yeasts such as, for example, Candida, 
Pichia and Saccharomyces or bacteria such as, for example, 

*° Bacillus, Clostridia or corynebacteria have also been described 
as riboflavin producers. 

DE 44 20 785 describes six riboflavin biosynthesis genes from 
Ashbya gossypii, and microorganisms which have been transformed 
^5 with these genes, and the use of such microorganisms for 
riboflavin synthesis . 
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To date, genes have been inserted into fungal riboflavin 
producers such as Ashbya gossypii via the markers leu2 (leucine 
auxotrophy), thr4 (threonine auxotrophy) or kan (kanamycin 
resistance) (WO 92/00379). A further marker described in yeasts 
5 is metis (methionine auxotrophy, Cost et al., Yeast, Vol, 12, 
1996: 939 - 941). The disadvantage of this marker is that either 
the transformation efficiency is very low and/or antibiotics must 
be continuously added for the selection. However, in each case, 
counterselection for loss of the marker with retention of the 

10 inserted genes in microorganisms is impossible or possible only 
with very great effort, so that it is usually no longer possible 
to insert further genes with these markers into the 
microorganisms. It is therefore desirable to have a selection 
marker which displays high transformation efficiency, is easily 

25 selectable and makes counterselection possible. 

The orotidine-5 '-phosphate decarboxylase gene (= URA3 gene) from 
Saccharomyces cerevisiae is one of the classical markers having 
the required properties and usable for transforming genes into 
microorganisms such as yeasts and fungi. The isolation of 
species-specific URA3 genes and the isolation of the 
corresponding gene from fungi (= pyrG) and the sequences thereof 
from Pichia stipitis, Candida boidinii, Kluyveromyces marxianus, 
Yamadazyma ohmeri, Candida maltosa, Aspergillus niger, 
Aspergillus oryzae, Aspergillus nidulans, Mucor circinelloides, 
Phycomyces blakesleeanus , Penicillium chrysogenum, and 
Aspergillus awamori have been described in a number of studies 
(Appl. Environ. Microbiol., Vol. 60, No. 12, 1994 : 4245 - 4254, 
Nucl. Acids Res., Vol. 18, No. 23, 1990: 7183, J. Ferment. 
Bioeng., Vol. 73, No 4, 1992: 255 - 260, Yeast, Vol. 9, 1993: 
677 - 681, Yeast, Vol. 10, 1994: 1601 - 1612, Curr. Genet., Vol. 
23, 1993: 205 - 210, Nucl. Acids Res., Vol.16, No. 5, 1988: 2339, 
Curr. Genet., Vol. 16, 1989: 159 - 163, Gene, Vol. 61, 1987: 
385 - 399, Gene, Vol. 116, 1992: 59 - 67, Mol. Gen. Genet., Vol. 
224, 1990: 269 - 278, Nucl. Acids Res., Vol. 16, No. 16, 1988: 
35 8177, Nucl. Acids Res., Vol. 18, No. 23, 1990: 7183 and Curr. 
Genet., Vol. 27, 1995: 536 - 540). 

Studies by Rose et al. (Gene, Vol. 29, 1984: 113 - 124) have 
shown that the URA3 gene from Saccharomyces cerevisiae is in fact 
40 capable of complementation of a corresponding mutation (pyrF 

gene = URA3) in prokaryotes such as Escherichia coli, and can be 
useful as selection marker. 



I 



>However, genetic stqdies on riboflavin synthesis by Ashbya 
45 gossypii (vitamin B2l synthesis) have shown that the URA3 gene 
from Saccharomyces derevisiae or the pyrF gene from Escherichia 
coli are [sic] not capable of complementation of 
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uracil-auxotrophic Ashbya gossypii mutants, and therefore these 
genes cannot be used for cloning genes into Ashbya gossypii. 




Attempts have therefore been made, because that [sic] gene from 
5 Ashbya gossypii corresponding to the URA3 gene or pyrF gene is 
unknown, to clone it. Attempts at cloning the Ashbya gene by the 
methods described in the literature via, for example, 
hybridization wi th URA3 gene fragments or via degenerate 
oligonucleotides based on conserved amino-acid sequences of 
10 various orotidin 5-5 ' -phosphate decarboxylases and screening a 
cDNA library using these oligonucleotides and the PCR technique 

(Bergkamp et al. Yeast, Vol. 9, 1993: 677 - 

Yeast, Vol. 10, 1994: 1601 - 1612, Benito et 
3\16, 1992: 59 - 67 and Diaz-Minguez et al., Mol. 
224, 1990: 269 - 278) . 



15 



were unsuccessful. 
681, Piredda et ^1. 
al., Gene, Vol, 
Gen. Genet., Vol. 



20 



It is an object of the present invention therefore to provide an 
easily selectable marker which can be transformed with high yield 
and is easily counterselectable and which makes it possible to 
insert genes into microorganisms. 



We have founA that this object is achieved by the novel 
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orotidine-5 '4 
SEQ ID NO: 1 
with the seqi 



phosphate decarboxylase [lacuna] having the sequence 
or its homologs which have at least 80% homology 
nee SEQ ID NO: 1. 



T 



35 



40 



Homologs of the novel orotidine-5/-phosphate decarboxylase gene 
having the sequence SEQ ID NO: l/mean, for example, allelic 
variants which have at least 80% homology at the derived 
amino-acid level, preferably j&t least 90% homology, very 
particularly preferably at least 95% homology. The amino-acid 
sequence derived from SEQ ZD NO: 1 is to be seen in SEQ ID NO: 
Allelic variants comprised in particular, functional variants 
which are obtainable bv/deletion, insertion or substitution of 
nucleotides from the sequence depicted in SEQ ID NO: 1, the 
intention being, however, that the enzymatic activity of the 
derived synthesized /proteins advantageously be retained for the 
insertion of one oj more genes. However, if the intention is to 
produce mutants iri the orotidine-5 ' -phosphate decarboxylase gene 
with the aid of SsEQ ID NO: 1 and its homologs in the novel 
process for producing uracil-auxotrophic microorganisms, 
non-functional/genes will be used, that is to say genes which 
lead to enzymatically inactive proteins. In this case, it is 
advantageous/ to use sequences which display homologies with SEQ 
ID NO: 1 or/ its homologs advantageously at the 3' and 5' ends* 



1. 



45 
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Homologs of SEQ ID NO: 1 additionally mean, for example, fungal 
or plant homologs, truncated sequences, single-stranded DNA or 
RNA of the coding and noncoding DNA sequence. Homologs of SEQ ID 
NO: 1 have at the DNA level a homology of at least 60%, 
5 preferably of at least 70%, particularly preferably of at least 
80%, very particularly preferably of at least 90%, over the 
complete DNA region indicated in SEQ ID NO: 1. 

Homologs of SEQ ID NO: 1 also mean derivatives such as, for 
10 example, promoter variants- The promoters upstream of the 

indicated nucleotide sequences may be modified by one or more 
nucleotide exchanges, by insertion(s) and/or deletion(s) without, 
however, the functionality or activity of the promoters being 
impaired- It is additionally possible for the promoters to have 
15 their activity increased by modifying their sequence, or to be 
completely replaced by more active promoters even from 
heterologous organisms . 

(^^i^erivatives also me&n variants whose nucleotide sequence in the 
^^20 region from -1 to ->00 in front of the start codon have [sic] 

been modified so as to alter, preferably increase, gene 

expression and/or protein expression. 

It is possible and preferred for SEQ ID NO: 1 or its homologs to 
25 be isolated from microorganisms of the family Metschnikowiaceae, 
particularly preferably from microorganisms of the genera 
Eremothecium, Ashbya or Nematospora, very particularly preferably 
from microorganisms of the genus and species Eremothecium ashbyii 
or Ashbya gossypii. 




35 



The novel gene const; 
ID No. 1 and its honi 
one or more regulato 
expression. Examples 
to which inducers or 



40 
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uct means the URA3 gene sequences [sic] SEQ 
.logs which have been functionally linked to 
y signals, advantageously to increase gene 
of these regulatory sequences are sequences 
repressors bind and thus regulate the 
expression of the nuileic acid. In addition to these novel 
regulatory sequences the natural regulation of these sequences 
in front of the actual structural genes can still be present and, 
where appropriate, have been genetically modified so that the 
natural regulation has been switched off and the expression of 
the genes has been increased. The gene construct can, however, 
also have a simpler structure, that is to say no additional 
regulatory signals have been inserted in front of the sequence 
SEQ ID No. 1 or its homologs, and the natural promoter with its 
regulation has not been deleted. Instead, the natural regulatory 
sequence has been mutiated so that regulation no longer takes 
place, and gene expression is enhanced. The gene construct may 
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additionally advantageously comprise one or more so-called 
enhancer sequences functionally linked to the promoter and making 
increased expression of the nucleic acid sequence possible. It is 
also possible to insert at the 3' end of the DNA sequences 
5 additional advantageous sequences, such as further regulatory 
elements or terminators. The URA3 genes may be present in one or 
more copies in the gene construct, and the gene or genes can also 
be inactivated. It is possible with the aid of this or these 
inactivated genes to generate uracil-auxotrophic mutants in the 

10 novel process. It is advantageous for further genes to be present 
in the gene construct for insertion of further genes into a 
microorganism. These genes may be located inside a URA3 gene, in 
which case there ought advantageously to be an intact copy of the 
URA3 gene and/or another selectable gene such as leu2, thr4 or 

15 kan present in the construct, or they can be located outside the 
URA3 gene. Even if an intact URA3 gene is present in the 
construct, further markers such as those mentioned above can, 
where appropriate, be present for selection in the gene 
construct . 

20 

Advantageous regulatory sequences for the novel process are 
present, for example, in promoters such as cos, tac, trp, tet, 
trp-tet, lpp, lac, lpp-lac, lacl<3' T7, T5, T3, gal, trc, ara, 
SP6, K-P R or k-P L promoter and are advantageously used in 
Gram-negative bacteria. Further advantageous regulatory sequences 

25 are present, for example, in the Gram-positive promoters amy and 
SP02, in the yeast or fungal promoters ADC1, MFa , AC, P-60, 
CYC1, GAPDH, TEF, rp28, ADH or in the plant promoters CaMV/35S, 
SSU, OCS, lib4, usp, STLS1, B33, nos or in the ubiquitin or 
phaseolin promoter. Also advantageous in this connection are the 

30 promoters of pyruvate decarboxylase and of methanol oxidase from, 
for example, Hansenula. It is also possible to use artificial 
promoters for the regulation. 

It is possible in principle to use all natural promoters with 
35 their regulatory sequences like those mentioned above for the 
novel process. It is also possible and advantageous in addition 
to use synthetic promoters. 

The gene construct may, as described above, also comprise further 
40 genes which are to be inserted into the microorganisms. These 
genes can be inserted inside or outside the marker genes such as 
ura3, leu2, thr4 or kan. It is possible in principle for all 
types of genes to be inserted into microorganisms with the aid of 
the novel URA3 gene having the sequence SEO ID NO: 1 or its 
45 homologs. It is possible and advantageous to insert and express 
in host organisms regulatory genes such as genes for inducers, 
repressors or enzymes which intervene by their enzymatic activity 
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in the regulation, or one or more or all genes of a biosynthetic 
pathway such as the genes of riboflavin biosynthesis such as, for 
example, the rib genes or genes of biosynthetic pathways which 
lead to other fine chemicals, secondary metabolites or proteins, 
5 such as the genes of biotin, lysine, methionine, vitamin B12 or 
carotenoid biosynthesis, or genes which lead to flavorings, 
growth promoters or odoriferous substances, or individual genes 
for enzymes such as proteases or lipases, via the URA3 sequence. 
These genes can be heterologous or homologous in origin. The 
10 inserted genes may have their own promoter or else be under the 
control of the promoter of the sequence SEQ ID No. 1 or its 
homo logs. 

For expression in the abovementioned host organism, the gene 
construct is advantageously inserted into a vector such as, for 
example, a plasmid, a phage or other DNA, which makes optimal 
expression of the genes in the host possible. Examples of 
suitable plasmids are, in E. coli, pLG338, P ACYC184, P BR322, 
pUC 18, pUC 19, P KC30, P Rep4, pHSl, pHS2 , P PLc236, pMBI.24, pLG200, 
PUR290, P IN-III 113 -B1, \gtll or pBdCI, in Streptomyces, pIJlOl, 

20 pIJ364, pIJ702 or pIJ361, in Bacillus, pUBllO, pC194 or pBD214, 
in Corynebacterium, pSA77 or pAJ667, in fungi, pALSl, plL2 or 
PBB116, in yeasts, 2^M, pAG-1, YE P 6, y Ep l3 or P EMBLYe23, or, in 
plants, PLGV23, pGHlac-, P BIN19, pAK2004 or pDH51. Said plasmids 
represent a small selection from the possible plasmids. Further 

25 plasmids are well known to the skilled worker and can be found, 
for example, in the book Cloning Vectors (Eds. Pouwels P. H. et 
al. Elsevier, Amsterdam-New York-Oxford, 1985, 
ISBN 0 444 904018) . 

30 The gene construct advantageously comprises, for expression of 
the other genes present, additionally 3' and/or 5' terminal 
regulatory sequences to enhance expression, which are selected 
for optimal expression depending on the selected host organism 
and gene or genes. 

35 

Cfarfg^These regullatory sequences are intended to make specific 

----- 0 f the genes and of the [sic] protein expression 

iis may mean, depending on the host organism, for 
the gene is expressed or overexpressed only after 
r that it is immediately expressed and/or 
d. 

The regulatory sequences or factors may moreover preferably have 
a beneficial effect on expression of the introduced genes, and 
45 thus increase it. It is possible in this way for the regulatory 
elements to be enhanced advantageously at the transcription level 
by using strong transcription signals such as promoters and/or 



■J expression 
possible 
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enhancers. However, in addition, it is also possible to enhance 
translation by, for example, improving the stability of the mRNA. 

In a further embodiment of the vector, the novel gene construct 
can also be advantageously introduced in the form of a linear DNA 
into the microorganisms and be integrated into the genome of the 
host organism by heterologous or homologous recombination. This 
linear DNA can consist of a linearized plasmid or only of the 
gene construct as vector. 



10 



Host organisms suitable in principle for the novel gene construct 
are all prokaryotic or eukaryotic organisms. The host organisms 
advantageously used are microorganisms such as bacteria, fungi, 
yeasts, animal or plant cells. Fungi or yeasts are preferably 
15 used, particularly preferably fungi, very particularly preferably 
fungi of the family Metschnikowiaceae such as Eremothecium, 
Ashbya or Nematospora. 

The invention additionally relates to a process for producing 
20 uracil-auxotrophic microorganisms. To generate uracil-auxotrophic 
mutants, the orotidine-5 ' -phosphate decarboxylase gene having SEQ 
ID NO: 1 or its homologs are modified, for example by 
mutagenesis, in such a way that the protein encoded by the gene 
is inactivated. This inactivated gene is subsequently introduced 
25 into a microorganism, for example by transformation or 
electroporation. Finally, homologous recombination in the 
microorganisms results in auxotrophic mutants which can be 
screened via their resistance to 5-f luoroorotic acid (see Boeke 
et al., Mol. Gen. Genet., Vol. 197, 1984: 345 - 346). 




35 



he invention furtfrer relates to a process for inserting DNA into 
organisms, which comprises inserting into an organism, preferably 
a microorganism, wt.ich is deficient in an orotidine-5 ' -phosphate 
decarboxylase gene (= URA3 gene) a vector which comprises an 
intact URA3 gene having the sequence SEQ ID NO: 1 or its 
homologs, advantage iously together with further DNA, preferably 
with at least one other gene, and cultivating this organism on or 
in a culture medium which contains no uracil. Only these 
organisms which have acquired the vector are able to grow in this 
medium. A linear DKA is preferably used as vector in this 
40 process. The microorganisms preferably used in this process are 
fungi, especially of the family Metschnikowiaceae such as 
Eremothecium, Ashbya or Nematosprora [sic], particularly 
preferably microorganisms of the genus Ashbya. 



45 It is also possible to use as vector any suitable plasmid (but 
especially a plasmid which harbors the origin of replication of 
the 2m plasmid from S. cerevisiae) which undergoes autonomous 
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replication in the cell, but also, as described above, a linear 
DNA fragment which is integrated into the genome of the host* 
This integration can take place by heterologous or homologous 
recombination. But preferably, as mentioned, by homologous 
5 recombination (Steiner et al., Genetics, Vol. 14 0, 1995: 973 - 
987) . 

The novel URA3 gbne having the sequence SEQ ID NO: 1 or its 
homologs can advantageously be used as selection markers in the 
10 novel process. I c is possible and preferred to insert genes using 
this selection marker genes [sic] into Ashbya gossypii. 

An additional advantage is that on transformation of Ashbya 
gossypii it is possible to select with the aid of this gene, 
15 without the need to use foreign DNA (i.e. DNA not derived from 
Ashbya gossypii). 

It is possible on transformation of Ashbya gossypii with the gene 
having SEQ ID NO: 1 or its homologs also to insert any other 
20 genes. This makes it possible to construct strains which harbor 
single genes or a plurality of genes in several copies either on 
plasmids or in the genome. 

It is further possible to construct Ashbya strains in which 
25 chromosomal copies of genes have been destroyed by the insertion 
of the URA3 gene having SEQ ID NO: 1 or its homologs. 

A particular advantage of the AgURA3 gene is the possibility of 
using the marker several times in succession in the same strain. 

30 

If identical nucleotide sequences are placed 5' and 3' of the 
gene in the same orientation (so-called direct repeats), it is 
possible if required to delete the AgURA3 marker again by 
homologous recombination and selection on uracil- and 
35 FOA-containing medium, and then in another round insert 

additional DNA with the aid of this gene. Another advantage is 
the distinctly greater transformation efficiency by comparison 
with the markers thr, leu or kan. 

In the novel process, the vector comprises as other gene at least 
40 one gene of riboflavin synthesis. Genes of riboflavin synthesis 
mean those genes which are involved in synthesis in the entire 
metabolism of riboflavin producers such as Ashbya. 



45 
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Examples : 



Example 1 : 

5 Production of a genomic gene bank from Ashbya gossypii ATCC10895 
Genomic DNA from Ashbya gossypii ATCC10895 was prepared by the 



process described in 



10 al., Yeast, Vol. 2, 
Sambrook, J. et al. 
manual, Cold Spring 



WO97/03208. The genomic gene bank derived 



from this DNA was constructed in pRS314 and in YEp351 (Hill et 

1986: 163 - 167) by the method described in 
(1989) Molecular cloning: A laboratory 
Harbor Laboratory Press or in [lacuna] F.M. 
et al. (1994) Current protocols in molecular biology, John Wiley 
and Sons. As can be inferred from, for example, WO97/03208, other 
15 plasmids, such as plasmids of the pRS series (Sikorski and 

Hieter, Genetics, 1^89: 19-27) or cosmids such as, for example, 
SuperCosl (Stratagente, La Jolla, USA), are also suitable for 
producing the gene bank. 

20 Example 2 : 

It was initially attempted to clone the gene for the 
orotidine-5' -phosphate decarboxylase (= OMP-DC) from Ashbya 
gossypii via functional complementation of a corresponding 
25 URA3 -auxotrophic mutant of Saccharomyces cerevisiae. 

To this end, a gene bank was constructed from genomic Ashbya 
gossypii DNA in pRS314 (as described in Example 1). This DNA was 
used to transform the S. cerevisiae strain MW3317-21A (genotype: 

30 MAT a, trpl, ade8AKpn, ura3-52, hom3-10, met 13, met4, ade2, 
his3-Kpn, see, for example, Kramer et al., Mol. Cell. Biol. 9, 
1989: 4432-4440), by the lithium acetate method (see, for 
example, Kramer et al., Mol. Cell. Biol. 9, 1989: 4432-4440)- No 
clone in which the genomic deletion of the ura3 gene of the S. 

35 cerevisiae strain was complemented by a gene fragment from Ashbya 
was obtained. 

The attempt to clone the URA3 gene of Ashbya gossypii via 
functional complementation in a pyrF mutant of E. coli also 
40 failed. 

Example 3 : 

An attempt to clone the OMP-DC gene from Ashbya gossypii by 
hybridization with a fragment of the corresponding gene from 
45 Saccharomyces cerevisiae was also unsuccessful. 
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For this purpose, the complete URA3 gene from Saccharomyces 
cerevisiae (gene bank entry yscodcd) was used as probe (length 
1.1 kb) in order to screen a genomic cosmid gene bank from Ashbya 
gossypii (see Example 1). The experiment was carried out as 
5 described in Sambrook, J. et al. (1989) Molecular cloning: A 
laboratory manual, Cold Spring Harbor Laboratory Press or 
Ausubel, F.M. et al. (1994) Current protocols in molecular 
biology, John Wiley and Sons, using hybridization temperatures of 
52°C to 68°C. It was not possible to identify in the gene bank any 
10 clones which provided a positive signal with the URA3 gene from 
S. cerevisiae as probe. 



Example 4 : 

15 in the next approach, it was attempted to clone the gene for 
OMP-DC from Ashbya gossypii by amplification of gene fragments 
using degenerate oligonucleotides and the PCR technique. 

For this experiment, the known amino-acid sequences of the 
20 various orotidine-5 ' -phosphate decarboxylases from the following 
organisms were compared, and regions showing maximum conservation 
in all the enzymes were selected: 

Aspergillus niger (Acc. number: P07817) 

25 Aspergillus nidulans (Acc. number: P10652) 

Schizosaccharomyces pombe (Acc. number: P14965) 
Penicillium chrysogenum (Acc. number: P09463) 
Kluyveromyces lactis (Acc. number: P07922) 
Candida albicans (Acc. number: P13649) 

30 Neurospora crassa (Acc. number: P05035) 

Ustilago maydis (Acc. number: P15188) 
Saccharomyces cerevisiae (Acc. number: P03962) 
Drosophila melanogaster (Acc. number: Q01637) 
Mouse (Acc. number: P13439) 

35 Human (Acc. number: P11172) 

The numbers given in parentheses are derived from the 
SWISS&PIR-Translated Datenbank Release 103. 

40 Degenerate olgcinucleotides [sic] were synthesized on the basis of 
this inf ormatic >n . 

Degenerate oligonucleotides mean oligonucleotides in which 
mixtures of nucleotides have been incorporated at several 
45 positions during the synthesis. 



10 
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in this connection, R represents G or A, Y represents C or T, W 
represents A or T, M represents A or C, K represents G or T, S 
represents C or G, H represents A, C or T, V represents A, C or 
G, B represents C, G or T, D represents A, G or T, and N 
5 represents G, A, T or C. 

The following oligonucleotides were used: 

URA2-A: 5 ' -jjYTNGGNCCNTAY ATHTG Y — 3 ' 
URA3^If 5^i— .TAYTGYTGNCCNARYTTRTCNCC - 3 « 
URaRc: l^jWy YTNATHTTYGARGAYMGNAARTT- 3 ' 
URA3*"-?f a 5 ' -GCNARNARNARNARNCCNC-3 ' 

Using these oligonucleotides as primers, PCRs were carried out 
15 with genomic DNA from Ashbya gossypii as template. 

The following primer combinations were used: 

URA3-A + URA3-B; URA3-A + URA3-D; URA3C + URA3-B and URA3-C + 
20 URA3-D. 

The following hybridization temperatures were used: 
52°C, 48°C, 44°C, 40°C and 37°C. 

25 The products resulting from the PCRs were cloned by conventional 
methods into the vector pGEMT (Promega) and were sequenced. It 
was not possible to amplify any fragments which showed homology 
with the known OMP-DC genes mentioned above. 

30 

Example 5: 

A cDNA bank was constructed from Ashbya gossypii as described in 
DE 44 20 785 Al (Example 1). 

35 

Example 6 : 

Analysis of nucleic acid sequences in the gene bank 

40 Single clones were selected from E.coli clones which comprised 
the gene bank from Ashbya gossypii described in Example 5. The 
cells were cultivated by conventional methods in suitable media 
(e.g. Luria broth with 100 mg/1 ampicillin), and plasmid DNA was 
isolated from these cells. 

45 
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Oligonucleotides which hybridize in the vector portion were used 
as primers for sequencing the cDNA clones. Fragments of the 
cloned cDNAs were detected in this way. The sequences were 
analyzed as described in Example 7 . 

5 

Example 7 : 

A computer-assisted analysis of the nucleotide sequences found 
was carried out by comparisons of newly identified sequences with 

10 existing DNA and protein data banks using the following 

algorithms, e.g. with BLAST algorithms (Altschul et al. (1990) J. 
Mol. Biol. 215, 403-410), the Clustal algorithm with the aid of 
the PAM250 weighting table or the Wilbur-Lipman DNA alignment 
algorithm (as implemented, for example, in the program package 

15 MegAlign 3.06 supplied by DNAstar) . It was possible in this way 
to discover similarities of the newly discovered sequences with 
previously known sequences, and to describe the function of novel 
genes or part-sequences of genes. 

20 Example 8: 

Identification of E. coli clones which harbor the gene for OMP-DC 
from Ashbya gossypii (AgURA3). 

25 After examination of a large number of clones as described in 
Examples 6 and 7 (> 100 clones), a sequence which showed 
similarities with known OMP-DC genes was found. This homologous 
process was then used to screen the genomic Ashbya gene bank (see 
Example 1) once again, and it was possible to identify clones and 

30 cosmids which gave a specific positive signal and harbored a 
common 1.3 kb XhoI-EcoRI fragment. Sequencing of the clones 
produced the sequence as described in SEQ ID NO: 1. The sequence 
shows similarities with known URA3 genes and codes for a protein 
about 29246 Dalton in size. 

35 

Example 9: 

Disruption of the chromosomal copy of the AgURA3 gene with 
antibiotic resistance genes 

.40 

I ^Disruption of a gSne means destruction of the functionality of a 
" genomic copy of tha gene either by (a) deleting part of the gene 
sequence or by (b) of the [sic] interrupting the gene by 
introducing a piede of foreign DNA into the gene or by (c) 



45 replacing part of 



the gene by foreign DNA. Any foreign DNA can be 



used, but it is preferably a gene which effects resistance to any 



\ 
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3^ 



suitable chemical. Any suitable resistance genes can be used to 
disrupt genes. 

To disrupt the AgURA3 gene of Ashbya gossypii ATCC10895, the 
5 kanamycin 1 resistance gene from Tn903, which [lacuna] under the 



control oi 
pages 1793 
flanked 5 
endonucled 
10 which make 



the TEF promoter of Ashbya gossypii (see Yeast 10, 
-1808, 1994 or WO92/00379), was used. The gene is 

and 3 ' by several cleavage sites for restriction 
ses, so that it was possible to construct a cassette 

[sic] possible any desired constructions of gene 



disruptions using conventional methods of in vitro DNA 
manipulation . 

The internal 370 bp Pstl-Kpnl fragment of AgURA3 (position 442 - 
15 892 in sequence SEQ ID NO: 1) was replaced by a resistance 

cassette as outline above. The resulting construct was given the 
name ura3::G418. The resulting plasmid can be replicated after 
transformation into E.coli. The XhoI-SphI fragment of the 
construct ura3::G418 (see Figure 1) was purified by agarose gel 
20 electrophoresis and subsequent elution of the DNA from the gel 
(see Proc. Natl. Acad. Sci. USA 26, 615-619, 1979) and employed 
to transform Ashbya gossypii. Figure 1 shows in depiction A a 
restriction map of the coding region of the AgURA3 gene and of 
the 5'- and 3 ' -untranslated regions (= 5'-UTR and 3'-UTR). 
25 Depiction B shows the situation after insertion of the kanamycin 
resistance cassette described above (= TEF-kanR) . 

The fragment was transformed into Ashbya gossypii either by 
protoplast transformation (Gene 109, 99-105, 1991) or else, 
30 preferably, by electroporation (BioRad Gene Pulser, conditions: 
cuvettes with slit widths of 0.4 mm, 1500V, 25^F, 100Q) . The 
selection of transformed cells took place on G418-containing 
solid medium (WO 97/03208). 

35 Resulting G4lfe-resistant clones were examined by conventional 
methods of PC* and Southern blot analysis (Sambrook, J. et al. 
(1989) Molecular cloning: A laboratory manual, Cold Spring Harbor 
Laboratory Pross and in [lacuna] F.M. et al. (1994) Current 
protocols in nolecular biology, John Wiley and Sons) to find 
40 whether the gonomic copy of the AgURA3 gene was in fact 
destroyed. Clones whose AgURA3 gene was destroyed are 
uracil-auxotrophic and resistant to 1 mg/ml 5 ' -f luoroorotic acid 
(FOA) . 

45 
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Example 10: 

Disruption of the chromosomal copy of the AgURA3 gene without 
using antibiotic resistance genes 

5 

A particular advantage of the use of URA3 genes is the 
possibility of selection both for the presence and for the 
absence of the gene. It is possible to screen with FOA 
microorganisms which have a functionally inactivated URA3 gene, 
10 and by means of selection for uracil prototrophy to select for a 
functionally active URA3 gene. 

To disrupt the genomic copy of the URA3 gene, for the sake of 
simplicity an internal fragment (= PstI fragment) of the URA3 

15 gene was deleted from the coding region of the gene having the 
sequence SEQ ID NO: 1 (position 442 to 520 in sequence 
SEQ ID NO: 1). Transformation of Ashbya gossypii with this 
deleted ura3 fragment was carried out as described in Example 10. 
In place of deletion of part-regions of the gene, it is also 

20 possible in principle to use all other methods for inactivating 
the gene, such as mutations via insertions, duplications, 
reversions, replacement of several nucleotides or point 
mutations. Point mutations are less preferred because reversion 
thereof is easy. 

25 

The transformants were selected through resistance to FOA. In 
contrast to wild-type clones, clones which harbor a disruption of 
the AgURA3 gene are resistant to 1 mg/ml FOA. 

30 Example 11: 

Use of the AgURA3 gene for inserting further DNA into 
A. gossypii . 

35 The isocitrate lyase gene described in WO 97/03208 was inserted 
with the aid of the plasmid pAGlOO, as described in WO 97/03208 
(Example 4 and 5), into AgURA3 disruption mutants of A. gossypii 
(see Example 9 and 10), using as selection marker in A. gossypii 
the AgURA3 gene in place of the G418 resistance described. 

40 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) 



APPLICANT: 

(A) NAME: BASF Aktkengesellschaf t 
STREET: Carl fBosch Strasse 
CITY : Ludwigfehaf en 
FEDERAL STATE: Rheinland-Pf alz 
COUNTRY : Germany 
POSTAL COEJE: D-67056 



(B) 
(C) 
(D) 
(E) 
(F) 



(ii) 



(iii) 



(iv) 



TITLE OF APPLICATION: Orotidine-5 ' -phosphate decarboxylase 
gene, gene construct comprising this gene and its use 

NUMBER OF SEQUENCES: 2 

COMPUTER-READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMmJTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ^D NO: 1: 
(i) 



SEQUENCE /CHARACTERISTICS: 

(A) /LENGTH: 1380 base pairs 

(B) / TYPE: nucleic acid 
(C y STRANDEDNESS : single 
( DA TOPOLOGY : linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) [Sic] AN^ISENSE: NO 

(vi) ORDINAL SOURCE: 

(A) ORGANISM: Ashbya gossypii 



(vii) 



(ix) 



2D I ATE SOURCE: 
(B) CLONE: ura3 

Features : 

(a) name /key: cds 

(b) location: 210. 



1013 
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(ix) FEATURES: 

(A) NAME /KEY : 5 ' pTR 

(B) LOCATION: l/. 199 

(ix) FEATURES: 

(A) NAME/KEY: 6'UTR 

(B) LOCATION: /1014. .1380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTCGAGCAAC TCATTGGAAG CCCTTCGCA^ ACGACCTCTA TATCTCGTCT CAAGTTCCTA 

CTATCATGTA TGCTGTCACT ACAGAAAAAT TTTTGTCTAT AGCTGGCAAG AAGCACATCA 

CATACATTCT GATGGTGTAG GCTCCAcAtC ACAGTAAGCA TTTGTATAAG GCTGATCACA 

TAGGGTGCTA CCGACCTAGC CATTGCf AC ATG TCA ACG AAA TCT TAC GCA GAA 

Met Ser Thr Lys Ser Tyr Ala Glu 
1 5 

AGG GCC AAG GCA CAC AAT TCg/ CCA GTT GCT AGA AAG CTT CTG GCA TTG 
Arg Ala Lys Ala His Asn Sef Pro Val Ala Arg Lys Leu Leu Ala Leu 
10 lfe 20 

ATG CAC GAG AAG AAA ACC aAt CTC TGC GCT TCC CTT GAT GTG CGG ACG 
Met His Glu Lys Lys Thr Afsn Leu Cys Ala Ser Leu Asp Val Arg Thr 
25 30 / 35 40 

TCT AGA AAG CTT CTG GAG /cTA GCA GAC ACG CTG GGA CCG CAC ATT TGT 
Ser Arg Lys Leu Leu Glu /Leu Ala Asp Thr Leu Gly Pro His He Cys 

45 / 50 55 

CTG CTG AAG ACA CAT GT^ GAC ATA CTG ACG GAC TTC GAC ATC GAG ACG 
Leu Leu Lys Thr His Val Asp He Leu Thr Asp Phe Asp He Glu Thr 
60 / 65 70 

ACA GTC AAG CCG CTG clvG CAG CTT GCG GCT AAG CAC AAC TTC ATG ATC 
Thr Val Lys Pro Leu Gin Gin Leu Ala Ala Lys His Asn Phe Met He 
75 I 80 85 

TTC GAG GAC CGC AAG ^TC GCT GAC ATT GGC AAC ACG GTT AAG CTG CAG 
Phe Glu Asp Arg Lys phe Ala Asp He Gly Asn Thr Val Lys Leu Gin 
90 I 95 100 

TAC TCC TCC GGC GTG ItAC CGT ATC GCG GAG TGG GCG GAT ATT ACC AAT 
Tyr Ser Ser Gly Val |ryr Arg He Ala Glu Trp Ala Asp He Thr Asn 
105 tlO H5 120 



60 
120 
180 
233 

281 

329 

377 

425 

473 

521 

569 
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GCA CAC GGC GTC ACC GGC CCC GGT GTG aJtA GCC GGG CTG AAG GAG GCT 
Ala His Gly Val Thr Gly Pro Gly Val lie Ala Gly Leu Lys Glu Ala 

125 A30 135 

GCG AAA CTG GCC TCA CAG GAA CCC AGG/ GGG TTG CTG ATG CTG GCA GAG 
Ala Lys Leu Ala Ser Gin Glu Pro Arfi Gly Leu Leu Met Leu Ala Glu 
140 lf5 150 

CTC TCT TCT CAG GGC TCT TTG GCG OGC GGA GAC TAT ACC GCG GGC GTC 
Leu Ser Ser Gin Gly Ser Leu Ala Arg Gly Asp Tyr Thr Ala Gly Val 
155 160 / 165 

GTT GAA ATG GCG AAG CTG GAC GAa/ GAC TTT GTG ATC GGG TTC ATC GCG 
Val Glu Met Ala Lys Leu Asp Glii Asp Phe Val He Gly Phe He Ala 
170 175 / 180 

CAG CGT GAC ATG GGT GGG CGT G^A GAC GGC TTT GAC TGG CTC ATC ATG 
Gin Arg Asp Met Gly Gly Arg Mia Asp Gly Phe Asp Trp Leu He Met 
185 190 / 195 200 

ACC CCG GGG GTT GGC CTG GAC Ig AC AAA GGA GAC GGC CTG GGC CAG CAG 
Thr Pro Gly Val Gly Leu Asp/ Asp Lys Gly Asp Gly Leu Gly Gin Gin 
205 / 210 215 

TAC CGC ACG GTG GAT GAG GTC GTC AGC GAC GGT ACC GAT GTG ATC ATT 
Tyr Arg Thr Val Asp Glu Vail Val Ser Asp Gly Thr Asp Val He He 
220 / 225 230 

GTT GGC AGA GGG CTC TTT ^AC AAG GGA AGA GAC CCC AAG GTC GAG GGT 
Val Gly Arg Gly Leu Phe Asp Lys Gly Arg Asp Pro Lys Val Glu Gly 
235 / 240 245 

GCC CGC TAC CGC AAG GCc/ GGT TGG GAG GCT TAC TTG CGC CGT ATG GGC 
Ala Arg Tyr Arg Lys All Gly Trp Glu Ala Tyr Leu Arg Arg Met Gly 
250 / 255 260 

GAG ACT TCG TAGTCTATC^ CTGGCGCCCA CAGTATATAG GCGGATTCCA 
Glu Thr Ser 
265 

CCGCCGATTA C CATC TC AG^ AACCTTTTTG TAATTATATG CCCCTATTGC CCTTATTTCC 
GAGCTGGTGC CGGGATCGGT TTATAGACGG GCAACAAGTT GATACTTTGT TCAGTAGCAT 
GCATCCAACA CTTGCAGGCT TGGGGTGTGG AAGGCCTCGC CGCGGATAAT TCGTATTACC 
CGCACTTCGT GAAGTATTbc TTTATGAAAA ATCTTCACTT TGGGCTAACT AGAGCCATAA 
CTCGACACAA GCCCCTTCJCT ACACACTTCG AGCTGGGACT AAAGTGACAA CGAATAGCAA 



617 

665 

713 

761 

809 

857 

905 

953 

1001 

1050 

1110 
1170 
1230 
1290 
1350 
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ATAATTAGCA AATATGGATG CGTTGAATTC 



1380 



(2) INFORMATION FOR SEQ ID NO: 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHi 267 amino acids 

(B) TYPE: kmino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE J protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Thr Lys Ser Tyr Ala fclu Arg Ala Lys Ala His Asn Ser Pro 
1 5/10 15 

val Ala Arg Lys Leu Leu Alaj Leu Met His Glu Lys Lys Thr Asn Leu 
20 / 25 30 

Cys Ala Ser Leu Asp Val Arg Thr Ser Arg Lys Leu Leu Glu Leu Ala 
35 / 40 45 

Asp Thr Leu Gly Pro His ile Cys Leu Leu Lys Thr His Val Asp lie 
50 / 55 60 

Leu Thr Asp Phe Asp lie/ Glu Thr Thr Val Lys Pro Leu Gin Gin Leu 
65 70 75 80 

Ala Ala Lys His Asn Phfe Met lie Phe Glu Asp Arg Lys Phe Ala Asp 

85 / 90 95 

lie Gly Asn Thr Val I/ys Leu Gin Tyr Ser Ser Gly Val Tyr Arg lie 
100 / 105 110 

Ala Glu Trp Ala Asp tie Thr Asn Ala His Gly Val Thr Gly Pro Gly 
115 / 120 125 

Val lie Ala Gly Leu/Lys Glu Ala Ala Lys Leu Ala Ser Gin Glu Pro 
130 / 135 140 

Arg Gly Leu Leu Met Leu Ala Glu Leu Ser Ser Gin Gly Ser Leu Ala 
145 / 150 155 160 



Arg Gly Asp Tyr Tnr Ala Gly Val Val Glu Met Ala Lys Leu Asp Glu 

1$5 170 175 



Asp Phe Val lie G 
180 



y Phe xle Ala Gin Arg Asp Met cily Gly Arg Ala 
185 190 
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Asp Gly Phe Asp Trp Leu lie Met 
195 200 

Lys Gly Asp Gly Leu Gly Gin Gin 
210 215 

Ser Asp Gly Thr Asp Val lie lie/ 
225 230 / 

Gly Arg Asp Pro Lys Val Glu iSly 

245 / 

Glu Ala Tyr Leu Arg Arg M«£t Gly 
260 / 



Thr Pyo Gly Val Gly Leu Asp Asp 
/ 205 

Ty/ Arg Thr Val Asp Glu Val Val 
/ 220 

Val Gly Arg Gly Leu Phe Asp Lys 
235 240 

Ala Arg Tyr Arg Lys Ala Gly Trp 
250 255 

Glu Thr Ser 
265 



